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ABSTRACT 

Binding of proteins to DNA is usually considered 1D 
with one protein bound to one DNA molecule. In 
principle, proteins with multiple DNA binding do- 
mains could also bind to and thereby cross-link dif- 
ferent DNA molecules. We have investigated this 
possibility using high-mobility group A1 (HMGA1) 
proteins, which are architectural elements of chro- 
matin and are involved in the regulation of multiple 
DNA-dependent processes. Using direct stochastic 
optical reconstruction microscopy (ofSTORM), we 
could show that overexpression of HMGA1a-eGFP 
in Cos-7 cells leads to chromatin aggregation. To 
investigate if HMGAIa is directly responsible for 
this chromatin compaction we developed a DNA 
cross-linking assay. We were able to show for the 
first time that HMGAIa can cross-link DNA directly. 
Detailed analysis using point mutated proteins re- 
vealed a novel DNA cross-linking domain. Electron 
microscopy indicates that HMGA1 proteins are 
able to create DNA loops and supercoils in linear- 
ized DNA confirming the cross-linking ability of 
HMGAIa. This capacity has profound implications 
for the spatial organization of DNA in the cell 
nucleus and suggests cross-linking activities for 
additional nuclear proteins. 

INTRODUCTION 

High-mobility group A (HMGA) proteins, which are con- 
sidered as architectural chromatin proteins, belong to a 
network of dynamic chromatin proteins that transiently 
reside on DNA/chromatin, fall off and rebind. Alike all 
HMG proteins they permanently scan the chromatin for 
available binding sites in a stop and go fashion thus 
allowing a flexible modulation of chromatin structure 
and function (1,2). Their binding to chromatin either 
promotes or suppresses DNA-dependent processes such 



as transcription, replication and DNA repair. 
Deregulated expression of HMGA proteins impairs gene 
expression, affects differentiation processes and causes 
several diseases (3-7). 

HMGA1 proteins play a key role in coordinating the 
assembly/disassembly of enhanceosomes, which need to be 
constructed on gene promoters to allow efficient transcrip- 
tion of many genes (4,5,8). Besides the modulation of local 
chromatin near regulatory DNA elements, HMGA 
proteins affect chromatin on a more global scale including 
chromatin loop formation, heterochromatin structure and 
chromosome condensation (2, 4). 

The HMGAla/b-splice variants and HMGA2-proteins, 
encoded by a separate gene, are characterized by three 
AT-hook DNA binding motifs, which preferentially bind 
to the minor groove of A/T-rich B-form DNA to both 
naked DNA and DNA on the nucleosomal surface 
(4,9,10). While a single AT-hook motif would be enough 
for binding, two AT-hook motifs considerably increase 
the binding of HMGA proteins to substrates and confer 
DNA binding with nanomolar affinity (9-14). The three 
individual AT-hooks do not contribute equally to the 
strength of binding due to variations in sequence and 
structure (9-11). An AT-hook consensus peptide 
comprised of 1 1 amino acids, and which is virtually iden- 
tical to the second AT-hook found in HMGA1 proteins is 
considered as high affinity type 1 AT-hook whereas the 
type 2 AT-hooks I and III bind with lower affinity (9,10). 
The affinity of AT-hook II is about one to two orders 
of magnitude greater than that of AT hook III (10). 
Amino acids flanking the AT-hooks as well as multiple 
modifications fine tunes the binding of HMGA proteins 
(2,9,10,15). Besides their preference for A/T-rich B-DNA, 
HMGA1 binding is independent of the DNA sequence 
and also recognizes peculiar structural features such as 
synthetic four-way junctions, bent DNA and supercoiled 
DNA (8,12,14,16). 

To gain more details about chromatin organization by 
HMGA1 proteins in the nucleus, we applied super- 
resolution fluorescence imaging using direct stochastic 
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optical reconstruction microscopy (rfSTORM) in fixed cells 
(17-19). The results suggest that HMGA1 proteins localize 
in multiple small domains evenly distributed throughout 
the cell nucleus. Interestingly, in cells overexpressing 
HMGA1 proteins these domains are not enlarged by ac- 
cumulation of additional HMGA1 molecules, but con- 
centrated in multiple domains accumulated in densely 
packed DNA. This observation suggested that HMGA1 
protein accumulation and DNA compaction correlate. 
The data assume that DNA clusters could be formed via 
local cross-linking of DNA through HMGA1 proteins. 

So far, binding of HMGA1 proteins to their substrates 
has been considered only in a ID way, meaning that one 
HMGA1 molecule binds to one DNA molecule containing 
multiple appropriately spaced A/T-tracts in close proxim- 
ity using its three DNA binding AT-hooks. To investigate 
DNA cross-linking activities, we developed a novel 
DNA-capture assay to investigate this putative function. 
The results demonstrate that HMGA1 proteins indeed are 
able to cross-link individual DNA fibers. Experiments 
with mutated HMGA1 proteins show that AT-hooks 
together with conserved basic amino acids located between 
AT-hooks II and III are essential to maintain this activity. 
Electron microscopy studies revealed that HMGA1 
proteins are able to create DNA networks, loops and 
coils in linearized DNA. Our findings imply that DNA 
binding proteins are able to cross-link DNA fibers and 
to create a DNA scaffold. 



MATERIALS AND METHODS 

rfSTORM microscopy 

Cos-7 cells were grown in Lab-Tek II chambered cover- 
glass in DMEM with 1 % PenStrep, 1 mM L-Glutamine 
and 10% fetal calf serum (Invitrogen) at 37°C and 5% 
C0 2 . Transient transfection of HMGAla-eGFP was per- 
formed using Effectene (Qiagen). Cells were washed in 
PBS, fixed in 2% formaldehyde/PBS for lOmin, permeab- 
ilized for lOmin in 0.1% Triton X-100/PBS, immersed in 
ice-cold methanol for 3min. Immunostaining was per- 
formed for 45min using affinity purified rabbit HMGA1 
antibodies (IG-1005, 1:150, Immunoglobe, Himmelstadt), 
mouse DNA antibodies (AC-30-10, 1:50, Progen, 
Heidelberg) and appropriate Alexa Fluor 647 secondary 
antibodies (Invitrogen). Cells were washed intensively 
overnight with excess of PBS, post-fixed with 2% formal- 
dehyde for 5-10 min and washed again with PBS. 

For (fSTORM imaging, Lab-Tek wells were filled with 
switching buffer, which is PBS (pH 7.4), containing 
oxygen scavenger [0.5mg/ml glucose oxidase (Sigma), 
40 ug/ml catalase (Roche Applied Science), 10% w/v 
glucose] and 50 mM mercaptoethylamine (Sigma). The 
rfSTORM setup, which contained an inverted microscope 
(Olympus IX-71) equipped with an oil-immersion object- 
ive (x60, NA 1.45; Olympus) was described previously 
(17-19). A 640 nm laser (Cube 640-100 C, Coherent) was 
used for excitation of Alexa 647 and a 488 nm laser 
(Sapphire 488 LP; Coherent) for eGFP. Photoswitching 
of Alexa Fluor 647 was performed with 0.05-0.5 kW/cm . 
A total of 10000-12 000 frames were measured with a 



frame rate of 60-100 Hz. The fluorescence light was col- 
lected by the objective, filtered by appropriate bandpass 
filters (Chroma ET700/75, HQ 535/50 AHF analys- 
entechnik AG), and imaged on an electron-multiplying 
charge-coupled device (EMCCD) camera (Ixon DV897; 
Andor). The images were analyzed and reconstructed 
with Rapid2STORM software (open source) as described 
by Wolter et al. (28). The fluorescence signals were 
analyzed applying asymmetry and photon thresholds, i.e. 
spots appearing too wide or too elliptical, as well as spots 
with less than 2000 or more than 6000 photons (HMGA1), 
or 300-2000 photons (DNA) per spot and frame were dis- 
carded from further analysis. It should be pointed out that 
the thresholds applied have no influence on the size of 
domains determined from reconstructed super-resolution 
images because each fluorophore is localized several times. 

Preparation of labeled DNA and cloning of constructs 

A major satellite repeat (satDNA) was amplified from 
murine genomic DNA by PCR using phusion polymerase 
(Finnzymes), cloned in pDrive (Qiagen) and sequenced. 

Terminally labeled satDNA with one molecule Cy5 or 
biotin (Cy5-satDNA or bio-satDNA, 563 bp) was pro- 
duced by PCR using Taq polymerase (NEB) and 5'-Cy5 
or 5'-biotinylated primers. Dual labeled Cy5/bio-satDNA 
was used to determine the binding efficiency to strepta- 
vidin. PCR products were quantified using a NanoQuant 
plate and a Tecan infinite M200 reader. 

Human HMGAla, HMGAlb, single- (R28G, R60G, 
R86G), double- (R28G R60G, R28G R86G, R60G 
R86G) or triple-mutated (R3xG) constructs were origin- 
ally produced by Harrer et al. (1). 

Truncated HMGAla lacking the C-terminal 17 (AL90), 
25 (AK82), 34 (AK71) or 43 (AG63) amino acids were 
produced by using appropriate primers. Point mutations 
K67G or RK73/74GG or K67G + RK73/74GG were cre- 
ated by site-directed mutagenesis. HMGN1 or HMGN2 
were cloned from murine cDNAs, respectively. 

All products were subcloned in pJet 2.1/blunt 
(Fermentas) cut with EcoRI and finally cloned in 
pET28a(+) (Novagen) and sequenced. Primers and se- 
quences can be found in the Supplementary Data. 

Protein preparation 

HMGA1 or HMGN proteins were expressed using 
Rosetta strain (Novagen) and induced with 0.5 mM 
IPTG at OD600 = 0.5. Extraction, purification and quan- 
tification of HMGA1 proteins were performed as desc- 
ribed by Reeves and Nissen (29). HMG proteins were 
quantified using a Jasco V-650 spectrophotometer. For 
relative quantification of HMG proteins, a Tecan infinite 
M200 reader and extinction at 230 nm was used. Protein 
purity was controlled in SDS-PAGE and Coomassie 
staining. 

DNA cross-linking assay 

Assays were performed in black 96-well plates pre-coated 
with streptavidin (Greiner bio-one). All incubations were 
at room temperature. Wells were pre-incubated with 
150 ul PBS for lOmin. Supernatants were carefully 
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removed by aspiration with a suction pump and a 
gel-loader tip. 500 ng of bio-satDNA were resuspended 
in 150 ul high-salt buffer (2M NaCl, lOmM Tris-HCl, 
pH 7.5, 1 mM EDTA) and incubated for 1 h. Wells were 
washed with 1 50 ul high-salt buffer and three times with 
freshly prepared PBST + (PBS, 0.1% Triton X-100, 0.3 M 
sucrose). Ten microgram of HMG protein was incubated 
in 150 ul PBST + on a roller shaker for 30min and 500 ng 
of Cy5-satDNA was added. The Cy5-satDNA/HMG 
solution was added to the washed wells and incubated 
overnight. After measuring the input Cy5 fluores- 
cence, wells were washed seven times with 1 50 ul PBST + 
and remaining Cy5 fluorescence was measured again. 
Measurements were performed using a Tecan infinite 
M200 reader. Excitation for Cy5-satDNA was set to 
641 nm and emission to 670 nm using a fixed gain. Each 
measurement was performed four times using five flashes. 
Relative fluorescence was calculated from mean absolute 
values (input Cy5 fluorescence/remaining fluorescence 
x 100%) and was defined as relative cross-linking effi- 
ciency (RCE). For statistical purposes, each experiment 
was at least reproduced four times. Statistical relevance 
was determined using a student's ?-test. Control A (Ctrl 
A) was performed incubating only Cy5-satDNA, control 
B (Ctrl B) was performed using Cy5-satDNA in wells con- 
taining precoupled bio-satDNA without any protein 
and control C (Ctrl C) was performed incubating Cy5- 
satDNA/protein in wells without precoupled bio- 
satDNA. Proteinase K treatment was performed after 
HMGAla cross-linking in PBST + by adding 50ng/ul pro- 
teinase K for 6h. 

Electron microscopy 

For DNA spreading experiments a pDrive vector contain- 
ing the major satellite DNA was linearized with Hindlll 
and gel extracted (Peqlab). Incubations were at room tem- 
perature. One hundred and fifty nanograms DNA and 
10 ug HMGAla or R3xG were resuspended in 100 ul in- 
cubation buffer (10 mM Tris-HCl, pH 8.0, ImM EDTA, 
25 mM NaCl) and incubated for 20 min. DNA staining for 
electron microscopy was principally made as described by 
Bock and Zentgraf (30). Two hundred microliters 0.2 M 
freshly prepared NH 4 OAc were mixed with 0.8 ul 1% 
cytochrome C in H 2 0 and added to the DNA/protein 
solution. Resulting solution (100 ul) was incubated on 
parafilm for further 20 min. The drop was slightly 
touched with a parlodione-coated 300 mesh copper grid 
and then directly contrasted by immersion in 90% EtOH 
with 50 uM uranylacetate for 30 s and 90% EtOH for 30 s. 
The grid was air-dried on filter paper. Grids were rotary- 
shadowed using a Pt-Pd wire with an angle of 5-7° for 
contrast enhancement. HMGAla/DNA samples were 
treated in the same way, but for higher complex count 
on the grid the DNA/protein solution was directly added 
to the grid for 30 s to 20 min, then prepared as desc- 
ribed above. Grids were analyzed with a magnification 
of 8000-10000 using a Zeiss EM10 equipped with 
a sheet film camera and Kodak SO 163 electron image 
films. 



RESULTS 

HMGA1 proteins localize in defined domains, which 
correlate with DNA compaction 

HMGA1 proteins preferentially localize to the more 
densely packed heterochromatin. We observed that tran- 
siently transfected Cos-7 cells display compaction of DNA 
as a consequence of HMGAla-eGFP overexpression. To 
analyze the nature of chromatin clustering with higher 
resolution, we used (/STORM. (/STORM relies on the 
use of organic fluorophores and does not require the 
close proximity of a second activator fluorophore (20) 
thus enabling the use of commercially available standard 
fluorescent probes in fixed (17) and living cells (18,21). 
For (/STORM microscopy, Cos-7 cells overexpressing 
HMGAla-eGFP were immunostained with Alexa Fluor 
647 labeled antibodies. Cells for (/STORM were identified 
by morphological criteria using wide field fluorescence mi- 
croscopy of eGFP with low HMGAla-eGFP expression 
showing a diffuse nuclear fluorescence (Figure la and c) 
and cells with high HMGAla-eGFP expression displaying 
distinct chromatin clusters (Figure lb and d). In addition, 
wide field fluorescence images of the corresponding 
immunolabeled cells were taken prior to (/STORM experi- 
ments (Figure la'-d'). Typically, we detected more than 
1000 photons per switching cycle from Alexa Fluor 647 
fluorophores allowing a localization precision of ~20nm 
in the imaging plane, experimentally determined from re- 
petitive localizations of individual Alexa Fluor 647 labeled 
antibodies (17). The c/STORM images (Figure la"-d" and 
a"'-d"') show a substantially improved resolution over 
the widefield fluorescence images and enable the identifi- 
cation of distinct HMGA1 domains with diameters of 
50-100 nm (Supplementary Figure SI A). Furthermore, 
the (/STORM images show that distances between 
adjacent HMGA1 domains of ~100nm can be easily 
resolved (Supplementary Figure S1A). When HMGAla- 
eGFP is expressed at low levels, these HMGA1 domains 
are evenly distributed in the nucleus (Figure la"-a"')- 
This pattern can also be observed comparable to that 
of the endogenous HMGA1 in non-transfected cells 
(Supplementary Figure SIB). In widefield fluorescence 
pictures the distribution of the overexpressed HMGAla- 
eGFP is more diffuse (Figure la) as compared to endogen- 
ous HMGA1 [Supplementary Figure lSB(a')]. 

In contrast, HMGAla-eGFP overexpression leads to 
accumulation of HMGA1 domains in chromatin clusters 
(Figure lb"-b"')- The increase in fluorescence intensity 
supports the idea of chromatin cluster formation in 
HMGAla-eGFP overexpressing cells (Supplementary 
Figure SIC). (/STORM experiments of immunostained 
DNA in cells with either low HMGAla-eGFP expression 
(Figure lc"-c"') or high HMGAla-eGFP expression 
(Figure ld"-d"') corroborate that HMGA1 -containing 
clusters consist of compacted DNA. Chromatin clustering 
through HMGAla-eGFP overexpression is not a specifi- 
city of the Cos-7 cell line and was also observed in the 
human cell line HepG2 (Supplementary Figure ID). This 
finding raises the question how this DNA compaction can 
be achieved by HMGAla mechanistically. 
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Figure 1. rfSTORM pictures acquired from Cos-7 cells transfected with HMGAla-eGFP. Widefield microscopy was used to select cells with either 
low HMGAla-eGFP expression (no chromatin clustering; a and c) or high HMGAla-eGFP expression (chromatin clustering; b and d). Widefield 
fluorescence images of immunostainings using HMGA1 and DNA antibodies are shown in a' and b' (HMGA1) and c' and d' (DNA). Note that in a' 
both endogenous HMGA1 and the fusion protein are detected by the immunostaining. Corresponding rfSTORM images are shown in a"-d". 
Highlighted boxes are shown in a"'-d"'. Scale bars are 5um in a-d" or 500 nm in a"'-d"'. Note the localization of HMGA1 in defined domains. 



DNA-DNA cross-linking activity of HMGA1 proteins 

The presence of three AT-hook binding domains and the 
capacity of a multivalent recognition of binding sites 
suggest that HMGA1 proteins can bind to A/T-tracts 
not only on the same, but also on different DNA fibers. 
Therefore, we developed a novel assay to investigate this 
assumption. The experimental strategy is outlined in 
Figure 2. For this assay, a PCR product of 563 bp con- 
taining A/T-rich mouse major satellite sequences 
(satDNA) was used. In the experiments, we immobilized 
a terminally biotinylated satDNA (bio-satDNA) on the 
surface of streptavidin-coated 96-well plates. Of 500 ng 
bio-satDNA, 50-55% were routinely bound to the 
bottom of the wells. To such pre-treated wells a Cy5- 
end-labeled satDNA (Cy5-satDNA) and 10 ug purified re- 
combinant HMG proteins were added. Coomassie 
staining of proteins after SDS-PAGE is shown in 
Supplementary Figure S2. After incubation the input 
Cy5 fluorescence was measured followed by measuring 
the remaining Cy5 fluorescence after several washing 
steps. The ratio of remaining Cy5 fluorescence over 
input was defined as RCE. In the presence of HMGA1 
proteins and precoupled bio-satDNA, the RCE was either 
20.4 ± 3.2% for HMGAla or 28.6 ± 4.2% for HMGAlb 
(Figure 3 A and E). To rule out non-specific binding of 
Cy5-satDNA control experiments were performed. To 
this end, Cy5-satDNA was either incubated in wells 
without bio-satDNA (Ctrl A) or with bio-satDNA (Ctrl 
B), but containing no proteins. The RCEs for the controls 
were 0.6 ± 0.3% for Ctrl A and 0.7 ± 0.3% for Ctrl B 
(Figure 3A). In further controls incubating Cy5-satDNA 
with either HMGAla or HMGAlb in wells without im- 
mobilized bio-satDNA we obtained RCEs of 2.7 ± 1.7% 
(Ala— Ctrl C) and 2.8 ± 1.8% (Alb— Ctrl C), respect- 
ively. These controls (protein — Ctrl C) were routinely 



run in parallel for each further experiment. Together 
these results indicate that HMGA1 proteins capture 
Cy5-satDNA through cross-linking to immoblized 
bio-satDNA. 

Significantly, Cy5 fluorescence of captured DNA was 
lost up to 98% after proteinase K treatment, supporting 
that cross-linking requires streptavidin-coupled bio- 
satDNA and HMGAla proteins. Furthermore, we also 
used HMGN1 or HMGN2 proteins, which preferentially 
bind to nucleosomes. In these experiments, the RCEs 
were also at background levels either at 2.0 ± 0.1% for 
HMGN1 (Nl— Ctrl C, 0.9 ± 0.3%) or 1.5 ± 0.2% for 
HMGN2 (N2— Ctrl C, 0.7 ± 0.1%) (Figure 3A). These 
results corroborate that the cross-linking capacity is 
HMGA1 specific. Of note, the RCE of HMGAlb was 
significantly higher as compared to HMGAla 
(P< 0.0001). The HMGAla- and HMGAlb-splice 
variants differ in 11 hydrophobic amino acids between 
AT-hooks I and II. This implies that either the hydropho- 
bicity of these 1 1 amino acids or the increased distance 
between AT-hooks I and II reduces the RCE of 
HMGAla. 

The role of the AT-hook motifs in DNA cross-linking 

Owing to the fact that HMGA1 proteins possess 3 
AT-hook binding domains, we investigated how the indi- 
vidual AT-hook motifs I III contribute to cross-linking. 
We, therefore, used the cross-linking assay applying point 
mutated proteins, where the second arginines in the minor 
groove binding core consensus R-G-R of the AT-hooks 
were replaced by glycines (1). Recombinant proteins used 
in the assay were point mutated in either AT-hook I 
(R28G), AT-hook II (R60G) or AT-hook III (R86G), 
double-point mutated in AT-hooks I + II (R28G R60G), 
1 + III (R28G R86G), 11 + III (R60G R86G) or 
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Figure 2. Principle of the DNA cross-linking assay. Streptavidin pre-coated wells are loaded with bio-satDNA. A protein/Cy5-satDNA solution is 
incubated. Input Cy5 fluorescence and, after intensive washing steps, remaining Cy5 fluorescence is detected. The ratio of nput / remaining Cy5 
fluorescence x 100% was defined as RCE. 



triple-point mutated in AT-hook I + 11+ III (R3xG) 
(Figures 3B and 4A). All RCEs obtained for these proteins 
are summarized in Figure 3E. Compared to wild-type 
HMGAla, point mutations of one AT-hook revealed 
significantly reduced cross-linking when either AT-hook 
I (R28G: 15.7 ± 1.6%, P = 0.0002), AT-hook II (R60G: 
14.4 + 4.3%, P< 0.0001) or AT-hook III (R86G: 
15.0 ± 4.6%, P = 0.0004) was mutated (Figure 3B). This 
clearly shows that binding of all three AT-hooks is 
required for efficient cross-linking. 

Double mutations in two out of three AT-hooks or 
mutation in all three AT-hooks further significantly di- 
minished the RCE as compared to wild-type HMGAla 
(P < 0.0001 for all mutants). As compared to the single 
AT-hook mutants the cross-linking efficiencies were 
further diminished in the double mutants (P < 0.0001 for 
I + II, P = 0.0056 for I + III and P = 0.0012 for II + III) or 
the triple mutant (P < 0.0001) (Figure 3B and E). Thus, 
the presence of more functional AT-hooks translates into 
an increased cross-linking efficiency. 

Cross-linking was affected mostly, when AT-hooks I 
and II were double mutated and the RCEs decreased to 
those of the triple mutant (Figure 3B and E). The effect 
was diminished when an AT-hook III mutation was part 
of the double mutant protein. The results support that 
AT-hook I and/or AT-hook II play a more important 
role than AT-hook III. This is consistent with data, 
where AT-hooks I and II were found to be the major 
players in DNA binding in vitro and in vivo (1,11,22). 
However, these data also indicate that at least one 
AT-hook is required to capture DNA independently of 
the considered AT-hook motif. Moreover, the data also 



indicate that a minor, but residual cross-linking capacity 
exists, even when all three AT-hooks are point mutated. 
This is in accordance with previous mobility studies in 
living cells, where the triple-point mutated protein also 
possesses a weak, but evident residence time on chromatin 
(1). Cross-linking assumes multivalent binding of at least 
two motifs on either DNA molecule. The results of the 
R28G, R60G and R86G mutants indicate that two func- 
tional AT-hooks are capable of cross-linking DNA. 
However, the relative high RCE values of the proteins 
containing two mutated AT-hooks suggest that one 
AT-hook plus additional sequences outside the 
AT-hooks could be sufficient to convey this function. 

Conserved basic amino acids located between AT-hooks II 
and III are required for efficient DNA cross-linking 

To seek for additional sequences we applied C-terminal 
truncated versions of HMGAla in the cross-linking assay 
(Figures 3C and 4A). HMGAla- or HMGAlb-proteins 
lacking the negatively charged C-terminal domain were 
found to bind A/T-rich DNA ~ 1.4-fold better than the 
native protein and such truncations were estimated to be 
8-10-fold more effective at introducing negative supercoils 
in plasmid DNA (12). Consistent with these findings, re- 
moval of the acidic 17 amino acids from the C-terminus 
of HMGAla (AL90) significantly increased the RCE 
~1.6-fold (37.9 ± 3.0%, P< 0.0001) (Figure 3C and E). 
Removal of 24 amino acids including AT-hook III 
(AK82) resulted in a cross-linking efficiency of 
23.5 ± 7.7% comparable to the full-length HMGAla. 
Significantly, a further HMGAla truncation (AK71) 
almost lost the cross-linking capacity and the RCE 
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Figure 3. Results of DNA cross-linking assays. Bar charts of cross-linking assays showing RCE in (A-D). Standard deviations are included. (A) 
Specific cross-linking of DNA through HMGA1 proteins (Ala, Alb). Controls to exclude non-specific binding are shown with Cy5-satDNA in wells 
without precoupled bio-satDNA (Ctrl A), in wells containing precoupled bio-satDNA (Ctrl B) or without precoupled bio-satDNA incubated with 
protein and Cy5-satDNA (protein — Ctrl C). RCEs of HMG proteins Nl and N2 reveal no DNA cross-linking (Nl — Ctrl C, N2 Ctrl C). Proteinase 
K treatment (Ala + Prot K) disrupts DNA cross-linking to 98%. (B) Cross-linking efficiencies of HMGAla-AT-hook mutants. Compared to 
HMGAla (Ala) single point mutations of one AT-hook (R28G; R60G; R86G) show significantly reduced RCEs. Double-mutants of two 
AT-hooks (R28G R60G, R28 R86G, R60G R86G) cross-link DNA with further diminished RCEs. Note that R28G R60G mutant has the 
lowest RCE. Triple-point mutation of all three AT-hooks (R3xG) again shows low RCEs. Corresponding controls without precoupled 
bio-satDNA incubated with protein and Cy5-satDNA (protein — Ctrl C) are given. (C) Cross-linking efficiencies of C-terminal truncated 
HMGAla proteins suggest the requirement of a cross-linking domain between AT-hooks II and III. Deletion of the acidic tail (AL90) of 
HMGAla leads to an increased RCE compared to the wild-type protein (Ala). Further deletion of Ala including the third AT-hook (AK82) 
gives RCEs again to wild-type levels (Ala). Further truncation decreases the RCE dramatically (AK71). Further truncation up to the second 
AT-hook (AG63) diminishes the RCE to background level. Corresponding controls without precoupled bio-satDNA incubated with protein and 
Cy5-satDNA (protein — Ctrl C) are given. (D) Point mutations in conserved amino acids between the second and the third AT-hook reduce DNA 
cross-linking. The point mutation K67G leads to a slightly decreased RCE compared to wild-type protein. Compared to wild-type protein the RCE 
of the RK73/74GG double mutant drops 50%. A triple-point mutated protein K67G + RK73/74GG shows a similar RCE as the double-point 
mutant RK73/74GG alone. Corresponding controls without precoupled bio-satDNA incubated with protein and Cy5-satDNA (protein — Ctrl C) 
were at background levels. This shows that the amino-acids R73 and K74 are essential for efficient DNA cross-linking. (E) Schemes of HMGA1 
proteins and summary of RCE. Sites of point mutations are indicated. RCE values and SDs are listed accordingly. RCEs of corresponding controls 
without bio-satDNA, but incubated with protein and Cy5-satDNA (protein — Ctrl C) are given in brackets. 



measured was 4.4 ± 3.1%. When applying a further 
truncated HMGAla protein (AG63) comprised only of 
the first and second AT-hook to the assay, the cross- 
linking activity was completely lost and dropped to back- 
ground levels (1.0 ± 0.08%). Thus, albeit functional 
AT-hooks are required to convey full cross-linking effi- 
ciency, even two AT-hooks are not sufficient when the 



sequence element between AT-hooks II and AT-hook III 
is missing. This indicates that the sequence element 
located between AT-hooks II and III is essential for 
DNA cross-linking. 

Within this region spanning the amino acids 64-82 the 
basic amino acids K64, K67, K70, R73 and K74 are highly 
conserved (Figure 4B). We, therefore, created the point 
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Figure 4. (A) HMGA1 mutations used indicated on the level of the amino acid sequence. (B) Species alignment of HMGA1 amino acid sequences of 
the region between AT-hooks II and III. Conserved amino acids are indicated (grey boxes). Compared are sequences from human (homo sapiens, 
CAI 14992), mouse (mus musculus, NP_057869), chicken (gallus gallus, NP_989700), zebra fish (danio rerio, NP_998333) and midge (chironomus 
tentans, CAA85365). 



mutated proteins HMGAla K67G and HMGAla RK73/ 
74GG where we replaced the indicated basic amino acids 
by glycines (Figure 4A). Of note, both mutated proteins 
lost the full capacity to cross-link DNA. The RCEs sig- 
nificantly dropped either to 16.7 ± 1.4% (K67G) or to 
10.1 ± 1.7% in the RK73/74GG mutant (Figure 3D and 
E). This suggests that K67 plays a minor role whereas R73 
and K74 are major players in DNA cross-linking. This is 
consistent with the cross-linking capacity of a K67G + 
RK73/74GG triple-mutated protein of 9.5 ± 0.5% and 
compares to that of the RK73/74GG double-mutated 
protein. Together the results propose that the sequence 
element between amino acids 65-74, with R73 and K74 
as an essential core is required for efficient DNA 
cross-linking by HMGAla. 

HMGAla creates DNA networks 

The capacity to cross-link DNA was also visualized at 
ultra-structural level using electron microscopy. To this 
end, 150ng of a linearized plasmid containing satDNA 
was contrasted with cytochrome C to visualize the DNA 
through the electron dense iron ion, which is part of the 
cytochrome C molecule. Spread preparations on parlo- 
dion coated copper grids revealed evenly distributed dis- 
tinct single DNA molecules of 1145.7 ± 117.5nm length 
in every sector of the grids [Figure 5A(a)]. Pre-incubation 
of 150 ng DNA with 10 ug HMGAla prior to cytochrome 
C treatment and spreading resulted in the creation of 
few DNA network patches containing loops and coils 
[Figure 5A(b-f)]. In the absence of proteins, DNA 
crossings were found randomly in ~9% and loops 
randomly in ~5% of counted DNA molecules (n = 361). 
Significantly, in the presence of HMGAla proteins the 
number of DNA crossings was increased to ~85% and 
loop formation was observed in 35% of counted molecules 
(n = 92). 



This supports that HMGAla proteins are able to cross- 
link DNA intra- and intermolecularly and are able to 
induce loop and coil formation [Figure 5A(c-f)]. 
Notably, an evenly distributed DNA network was also 
created after pre-incubation with the mutant HMGAla 
bearing the point mutation in each AT-hook (R3xG) 
[Figure 5A(g)]. Thus, weak binding of AT-hooks still en- 
ables DNA cross-linking. This is in accordance with the 
findings obtained with the cross-linking assay presented 
above. However, loop formation was not observed 
indicating that creation of loops requires both, 
AT-hooks and the cross-linking domain of HMGAla. 



DISCUSSION 

f/STORM suggested that HMGA1 proteins localize con- 
centrated in defined domains with a maximal diameter of 
50-100 nm. The localization in defined domains is consist- 
ent with previous findings that HMGA proteins localize to 
A/T-rich G/Q- and C-bands of human and mouse meta- 
phase chromosomes (23). Together with our findings, this 
implies that the localization of HMGA proteins in defined 
chromosomal regions persists throughout the cell cycle. 
Nevertheless, in cells highly overexpressing HMGAla, 
the DNA is compacted in clusters, which contain a 
crowd of spatially compacted HMGAl-containing dots. 
Likewise, calculation of the labeling intensities revealed 
that HMGA1 molecules accumulate in defined domains 
within clusters of tightly concentrated DNA. We, there- 
fore, asked whether HMGA1 proteins could be respon- 
sible for DNA compaction and assumed that HMGA1 
proteins could mediate or stabilize a compacted DNA 
scaffold through cross-linking of DNA. 

Using a specifically developed cross-linking assay we 
confirmed this assumption. This assay revealed that effi- 
cient cross-linking requires at least some low affinity DNA 
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Figure 5. (A) Electron microscopic analysis of HMGA1-DNA interaction using DNA spread preparations. Naked DNA is shown in a. DNA 
incubated with HMGAla is presented in pictures b-f. Note the DNA cross-linking in b and loops and coils in c-f. DNA incubated with the R3xG 
AT-hook triple mutant is shown in g. Low DNA binding and an intact region between AT-hooks II and III are sufficient to create a DNA network. 
Bars in a, b and g are 500 nm. Bars in c-f are 250 nm. (B) Models of DNA cross-linking through HMGA1. Either a clamp- or a push-button model 
is conceivable. In the clamp-model HMGA1 molecules bind to DNA with their AT-hooks, whereas the cross-linking domain-containing R73 and 
K74 overstretches an adjacent DNA fiber (a and b). The third AT-hook either contacts the same DNA molecule than AT hooks I and II (a) or 
contacts the second DNA molecule (b). In the push-button model, the cross-linking domain contacts a neighboring DNA fiber (c). 



binding either of one functional AT-hook or three weakly 
binding mutated AT-hooks. In addition, an essential 
sequence element between AT-hooks II and AT-hook III 
containing the conserved basic amino acids R73 and K74 
is crucial. We, therefore, suggest that this region plays an 
important role as cross-linking domain of HMGA1 
proteins. Thus, in the wild-type protein both the 
AT-hooks and the cross-linking domain act together to 
convey full DNA cross-linking activity. The cross-linking 
activity is intrinsic and does not require energy. Our 
finding of this novel function of HMGA1 proteins is in 
full agreement with previous findings that HMGA 
proteins recognize and bind to non-B-form DNA 
including four- way junction DNA (14), bent and super- 
coiled DNAs (12) and distorted DNA on isolated nucleo- 
somal core particles (16). 

Soluble HMGA molecules are unstructured disordered 
random coils and only adopt a defined conformation 



when bound to DNA or chromatin assuming a kind of 
flexibility in selecting their binding substrates (10). Such 
flexibility has to be assumed for any cross-linking activity. 
Potential models of how HMGA1 proteins could 
cross-link adjacent DNA fibers are summarized in 
Figure 5B. Combining our results and previous findings 
either a clamp- or a push-button model are likely. In the 
clamp-model HMGA molecules bind to DNA with their 
AT-hooks, whereas the cross-linking domain containing 
R73 and K74 overstretches an adjacent DNA fiber 
[Figure 5B(a-b)]. In the push-button model, the 
cross-linking domain could loop out and contact a neigh- 
boring DNA fiber [Figure 5B(c)]. When unraveling the 
solution structure of AT-hooks bound to the minor 
groove of DNA, it previously was found that only the 
RGR core peptide fits deep into the minor groove (10). 
Six amino acids C-terminal of the core, including K65 and 
K67, only present C-terminal to AT-hook II, were found 
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to interact with the sugar-phosphate backbone on either 
edge of the minor groove (10). It is conceivable that in 
addition the basic amino acids K71, R73 and K74 simi- 
larly contact the sugar-phosphate backbone to capture 
adjacent DNA molecules. This would be consistent with 
both, a clamp- or a push-button model. 

However, several previous findings rather support the 
clamp model. Binding of HMGA1 to four-way junction 
(4H) DNA occurs via contact of two out of three 
AT-hooks on opposite arms of the branch point and 
contacts across the 4H DNA branch point (14). A 
subfragment containing AT-hooks II and III plus the 
intervening peptide binds to 4H DNA with about 
the same affinity as the full-length protein (14). Thus the 
contacts over the branch point occur within the sequence 
between AT-hooks II and AT-hook III, which we also 
found to be relevant for the cross-linking activity. In 
addition, findings that HMGA1 proteins are able to intro- 
duce supercoils (DNA crossings) in plasmid DNA in vitro 
without requiring energy (4,12) are consistent with a 
clamp like model where DNA crossings, as occurring in 
supercoils, are stabilized. In support, supercoil-like DNA 
crossings were also observed in our electron microscopy 
studies using linearized DNA. 

The capacity of DNA cross-linking has implications for 
several architectural functions that have been described 
for HMGA proteins. For example, together with histone 
HI and topoisomerases II HMGA1 co-localizes at 
scaffold attachment regions (SARs). SARs are thought 
to be A/T-rich DNA elements at the base of large 
gene-containing DNA loops and considered as structural 
backbone of metaphase chromosomes (24). Interestingly, 
we found loop formation in linearized plasmids in our 
electron microscopy studies, which is in full agreement 
with these previous studies and with a localization of 
HMGA1 at the base of DNA loops. 

Biochemical results demonstrated a multivalent DNA 
binding to the IFN-y promoter (13) and HMGA1 is a 
key player in coordinating enhanceosome organization 
during the on/off regulation of interferon transcription 
[for a summary see (4)]. Interestingly, acetylation of K65 
destabilizes the enhanceosome, whereas acetylation at 
K71 enhances IFN-y transcription (25). Moreover, it 
was recently described that the histone acetyl transferases 
PCAF and p300 also acetylate K67 and K74 of HMGA la 
(26). Since, especially these lysines are involved in DNA 
cross-linking it is conceivable that DNA cross-linking and 
its regulation through acetylation could be involved in 
stabilization/destabilization of the enhanceosome organ- 
ization. Within this context, it is noteworthy that these 
lysines are hyperacetylated in PC-3 human prostate cancer 
cells and increased HMGA la acetylation was observed in 
metastatic breast cancer cells (27). A hallmark of malig- 
nant tumors is HMGA1 overexpression. Therefore, it is 
intriguing to speculate that HMGA1 induced DNA 
cross-linking and a modulation through acetylation may 
influence DNA organization in tumor cells. 

HMGA1 proteins are involved in the organization of 
heterochromatin, influence global chromatin structure 
during apoptosis or cellular senescence and are involved 
in chromosome architecture and compaction. It is 



conceivable that the spatial organization of a DNA 
scaffold through the cross-linking capacity of 
HMGA1 proteins plays a significant role in these 
processes. Moreover, besides HMGA1 proteins addition- 
al chromatin proteins could possess a similar intrinsic 
DNA cross-linking activity using identical or different 
domains. 
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